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Abstract. We present the results of a model independent analysis, which systematically scans the data taken 
by CMS for deviations from the Standard Model predictions. Due to the minimal theoretical bias this approach is 
sensitive to a variety of models for new physics. Events with at least one electron or muon are classified according 
to their content of reconstructed objects (muons, electrons, photons, jets and missing transverse energy). A broad 
scan of three kinematic distributions in those classes is performed by identifying deviations from Standard Model 
expectations, accounting for systematic uncertainties. 



1 Introduction 

The start-up of the LHC brings forth a new era of high en- 
ergy particle physics. What we will find is yet unknown, 
however there is a large number of theories predicting pos- 
sible outcomes. Many of those theories are tested by ded- 
icated analyses at CMS and the other LHC experiments. 
However new physics could as well manifest itself in ways 
no-one has yet thought of. Thus we have implemented a 
Model Unspecific Search in CMS: MUSiC. Model unspe- 
cific searches aim to analyse a large fraction of data, sys- 
tematically scanning them for deviations from the Stan- 
dard Model (SM). Therefore the selection cuts are not op- 
timized for any expected new physics signal; however, the 
quality of the measurement is ensured by selecting well- 
measured and well-understood physics objects such as iso- 
lated high-/?j- leptons. Any deviation seen might be a de- 
tector effect, a lack of understanding of the event genera- 
tion and simulation or truly a new physics signal. Similar 
strategies have already been applied successfully at other 
accelerator experiments, like HI at HERA, D0 and CDF at 
Tevatron. 



2 Search Algorithm 

New Physics usually shows up in distinctive final states 
and so the first task is to sort the events. We consider the 
following physics objects: muons (ju), electrons (e), pho- 
tons (y), hadronic jets (jet), and missing transverse energy 
(MET). Events are sorted into event classes, which repre- 
sent a single exclusive final state, depending on their con- 
tent of reconstructed objects. To avoid using the same en- 
ergy entry more than once, objects are removed if they are 
too close to each other: Jets are removed if there are pho- 
tons or electrons nearby, and photons are removed if an 
electron is close. An event class can include up to four 
leptons, two photons and eight jets, with at least one lep- 
ton in each event class. After assigning events to event 
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classes, three kinematic distributions are examined, which 
are promising to spot new physics: 

- Scalar Sum of the Transverse Momentum of all partic- 
ipating objects: Ep T . 

- Combined Invariant Mass M of those objects. For classes 
containing MET, the transverse invariant mass Mj is 
used. 

- Missing Transverse Energy in classes containing MET 
above our predefined threshold. 

Out of these distributions Epj is the most general observ- 
able, sensitive to many new physics models involving heavy 
new particles or modified high-energy behaviour. The in- 
variant mass allows the discovery of new resonances. New 
physics with heavy or highly boosted invisible particles 
will show up in the MET distribution. 

The algorithm systematically scans for deviations, com- 
paring the simulated SM prediction with the measured data. 
This is done by calculating a p-value for each connected 
bin region. Since all distributions are analysed as binned 
histograms, the bin width is adjusted to the resolution of 
the considered variable. It is selected to be the smallest 
multiple of lOGeV which exceeds the expected resolution. 
The p-value is the probability of a discrepancy between 
data and Monte Carlo, which is at least as extreme as ob- 
served, if the hypothesis is true that the Monte Carlo de- 
scribes the data within its uncertainties. The region with 
the smallest p-value contains the most significant discrep- 
ancy and we call it the Region of Interest (Rol). 

While p denotes the probability of each Rol seen in- 
dividually, it cannot be used as a statistical estimator for 
the global significance for such deviation in any region. A 
penalty factor needs to be included to account for the num- 
ber of investigated regions. Using pseudo-experiments, we 
can compute a new probability p of measuring at least one 
deviation in any region in this distribution with a lower p- 
value than the one seen in the Rol of the data. The value 
p for a given distribution is then simply the number of 
pseudo-experiments with p pse udo < Pdata, divided by the 
total number of pseudo-experiments. 
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Fig. 1. Sum pT distribution of p of data-vs-SM (black crosses) 
compared to SM-vs-SM (shaded area) 



Fig. 2. Invariant Mass distribution of p of data-vs-SM (black 
crosses) compared to SM-vs-SM (shaded area) 



Both probabilities p and p can be translated into stan- 
dard deviations of normal distributions, which is a com- 
mon way to show significances. As MUSiC is sensitive to 
both an excess and lack of events, a two-sided normal dis- 
tribution is used. 

The reconstruction efficiencies and the misidentifica- 
tion probability are determined from simulation and uncer- 
tainties are applied to cover possible differences to data. 
The uncertainty on the reconstruction efficiency is esti- 
mated with 4% for muons, 3% for electrons and photons 
and 1% for jets. For the uncertainty on the misidentifica- 
tion probability, we use 30% for photons, 50% for muons, 
and 100% for electrons. 
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Fig. 3. Missing Transverse Energy distribution of p of data-vs- 
SM (black crosses) compared to SM-vs-SM (shaded area) 



3 Sensitivity Tests 

To claim the absence of any obvious deviations the search 
needs to have a sufficient sensitivity to new physics. A 
number of tests scenarios has been studied to show that 
MUSiC would be able to successfully point out discrep- 
ancies. One test was to remove top-quark pair production 
process from the SM background and eight classes with a 
significance of 3cr or higher were observed. Another test 
was to assume a 500 GeV Z' (neutral heavy gauge boson) 
as potential signal for New Physics. Here, di-muon and 
di-electron invariant mass distribution exceeded Act level. 
And after assuming SUSY benchmark point LM0 as po- 
tential signal, deviations in a large number of classes were 
clearly visible. 



5 Summary 

The implementation and results of a model independent 
analysis of the 2010 CMS data (at an integrated luminosity 
of 36.1/pb) have been presented. Studies have been per- 
formed to evaluate the sensitivity of the approach to cer- 
tain kinds of scenarios. MUSiC illustrates good agreement 
between data and SM expectation within uncertainties. 



4 Results 



The analysis has been applied to the data taken by CMS 
in 2010 and no significant deviations have been observed. 
Overall 287 distributions in 118 classes have been anal- 
ysed. The distributions of the p of all analysed event classes, 
separated by kinematic distribution, i.e. Scalar Sum of the 
Transverse Momentum (Fig. 1), Invariant Mass (Fig. 2) 
and Missing Transverse Energy (Fig. 3) are shown here. 
The shaded area is the SM simulation and the crosses rep- 
resent the data, and no unexpected deviations are found. 
The complete results for all the classes can be found at: 
https://cms-project-music.web.cern.ch/cms-project-music. 



